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INTRODUCTION 

PTEN/MMAC1/TEP1  (referred  as  PTEN  in  the  remaining  text  of  this  paper)  is  a  tumor  suppressor  gene 
that  maps  to  the  10q23.3  and  has  been  shown  to  be  deleted  or  mutated  in  many  human  tumors  including 
glioblastomas,  endometrial  neoplasms,  hematological  malignancies,  and  prostate  and  breast  cancers  (Ali  et  al., 
1999;  Cantley  and  Neel,  1999;  Dahia,  2000;  Di  Cristofano  and  Pandolfi,  2000).  In  addition,  germline  mutations 
in  PTEN  cause  Cowden  syndrome  (CS),  characterized  by  multiple  hamartomas  and  a  high  proclivity  for 
developing  cancer  (Cantley  and  Neel,  1999).  The  5’-UTRs  of  human  and  mouse  PTEN  are  highly  homologous 
(the  0.9  kb  5’-UTR  near  the  translation  start  codon  shows  -90%  homology)  and  have  many  features  of  a 
translationally-regulated  gene.  Firstly,  they  are  significantly  longer  (>0.95  kb)  than  the  average  eukaryotic  5’- 
UTR.  Secondly,  they  contain  multiple  short  open  reading  frames  (>3  for  human  and  mouse  PTEN).  Thirdly, 
they  are  GC  rich  with  predicted  stable  complex  secondary  structures.  These  features  of  5’-UTR  are  expected  to 
significantly  hinder  translation  initiated  by  the  conventional  cap-dependent  mechanism.  Thus,  an  alternative 
mechanism  such  as  internal  ribosome  entry  may  be  involved  in  regulating  the  translation  of  PTEN.  It  is  possible 
that  in  some  advanced  prostate  cancers  PTEN  expression  is  omitted  due  to  miss-regulation  of  IRES-mediated 
translation  initiation.  In  this  project,  we  plan  to  test  this  theory  which  is  novel  and  has  not  been  anticipated 
previously. 

BODY 

In  the  original  application,  we  proposed  to  accomplish  two  tasks  in  the  first  year  as  detailed  below: 

Taskl  (months  1-6):  Cap-dependent  translation  and  IRES  activity  of  PTEN/MMAC1  5’-UTR.  The 

5’-UTR  of  PTEN/MM  AC  1  will  be  cloned  into  pGEM  vector  for  in  vitro  and  into  a  dicistronic  vector  for  in  vivo 
analysis. 

Task  2  (months  6-13):  Boundary  of  PTEN/MMAC1  IRES  and  mutagenesis  effect.  Systematic 
deletion  and  site-directed  mutagenesis  will  be  performed  using  the  dicistronic  construct  to  determine  the  boundary 
of  the  IRES  of  PTEN/MMAC1  5’-UTR  and  to  map  the  important  element. 

In  the  first  year,  we  have  accomplished  testing  the  5’-UTR  sequence  of  PTEN  as  planed  in  the  task  1. 
Analysis  of  the  5’-UTR  using  Zuker’s  mfold  program  (version  3.0)  (Mathews  et  al.,  1999)  showed  that  this 
region  has  potential  to  form  various  secondary  structures  and  all  of  the  structures  predicted  are  extremely  stable 
with  a  typical  free  energy  as  high  as  -432  kcal/mole,  which  is  well  beyond  the  energy  shown  to  inhibit 
ribosome  scanning.  In  addition,  this  region  is  also  punctuated  with  four  translation  start  codon  AUGs  that 
encode  two  short  upstream  open  reading  frames  (uORFs)  with  36  and  45  amino  acids,  respectively  (two  of  the 
AUGs  are  located  at  the  end  of  the  first  uORF  in  the  same  frame).  Both  uORFs  end  before  -55 1  bases  upstream 
of  the  physiological  translation  start  codon.  Generally,  uAUGs  or  uORFs  inhibit  translation  of  the  main  ORF  by 
interfering  the  scanning  process  of  the  40S  ribosome  complex.  To  analyze  the  effect  of  the  5’-UTR  sequence  of 
mouse  PTEN  on  translation,  we  utilized  an  in  vitro  transcription  and  translation  system.  Mouse  PTEN  cDNA 
with  and  without  the  5’-UTR  sequence  were  cloned  into  pGEM-4Z  under  the  control  of  SP6  promoter.  Run-off 
transcripts  were  generated  by  SP6  RNA  polymerase,  purified  and  quantified,  and  equal  amount  of  RNAs  were 
used  to  program  cell-free  translation  in  rabbit  reticulocyte  lysate.  We  found  that  no  full-length  PTEN  protein 
was  translated  from  transcripts  containing  the  long  5’-UTR  whereas  significant  amount  of  PTEN  protein  was 
generated  in  the  absence  of  the  5’-UTR  sequence.  Clearly,  the  presence  of  the  long  5’-UTR  sequence  drastically 
inhibits  the  translation  of  PTEN  in  rabbit  reticulocyte  lysate. 

To  analyze  whether  the  protein-encoding  sequence  (ORF)  influence  the  translational  inhibition  effect  of 
the  5’-UTR,  we  cloned  the  5’-UTR  sequence  of  PTEN  immediately  upstream  of  an  open  reading  frame 
encoding  firefly  luciferase  in  a  pGEM-4Z  based  reporter  vector,  pG-LUC,  resulting  in  the  plasmid  pG-5’LUC. 
This  construct  was  then  used  to  program  in  vitro  transcription  and  cell-free  translation  as  described  above. 
Again,  the  translation  programmed  by  pG-LUC  transcript  generated  significantly  increasing  amount  of  firefly 
luciferase  activity  proportional  to  the  amount  of  transcripts  used  while  no  luciferase  activity  was  detected  from 
the  translation  programmed  by  pG-5’LUC  transcripts.  Thus,  the  5’-UTR  sequence  of  mouse  PTEN  inhibits 
translation  of  transcripts  encoding  a  heterologous  protein  and  is  independent  of  the  protein-encoding  sequence. 


4 


The  5’-UTR  sequence  of  PTEN  has  also  been  cloned  into  the  dicistronic  expression  vector  pRF  and 
tested  for  its  IRES  activity.  The  IRES  sequence  of  human  rhinovirus  (HRV)  was  engineered  in  the  same  way 
and  was  used  as  a  positive  control.  The  pRF-based  constructs  contain  an  SV40  promoter  to  direct  the 
transcription  of  dicistronic  RNA  encoding  Renilla  luciferase  as  the  first  cistron  and  firefly  luciferase  as  the 
second  cistron.  Translation  of  the  first  cistron  ( Renilla  luciferase)  serves  as  an  indicator  of  cap-dependent 
translation  while  translation  of  the  second  cistron  (firefly  luciferase)  reflects  the  IRES  activity-associated  with 
the  inserted  intergenic  sequence.  This  approach  is  considered  as  a  “gold  standard”  for  characterizing  cellular 
IRES  (Sachs,  2000).  These  dicistronic  constructs  were  transfected  into  HeLa  cells  and  both  Renilla  and  firefly 
luciferase  activities  were  measured.  We  found  that  the  5’-UTR  sequence  of  PTEN  stimulated  the  expression  of 
firefly  luciferase  (second  cistron)  by  ~70  fold  over  negative  vector  control  and  about  2-3  fold  over  the  positive 
pR-HRV-F  control.  Similar  results  were  also  obtained  using  another  cell  line  HI 299.  These  results  suggest  that 
the  5’-UTR  sequence  of  PTEN  may  contain  an  IRES  element. 

We  have  also  accomplished  the  systematic  deletion  and  site-directed  mutagenesis  to  determine  the 
boundary  of  the  IRES  of  PTEN/MMAC1  5’-UTR  as  planed  in  task  2.  To  define  the  boundaries  in  the  potential 
IRES  in  the  5’-UTR  sequence  of  PTEN  for  enhancing  expression  of  the  second  cistron,  a  series  of  deletion 
mutants  of  the  5’-UTR  of  PTEN  were  engineered  for  dicistronic  assay.  Deletion  from  both  ends  of  the  5’-UTR 
resulted  in  gradual  decrease  in  stimulating  firefly  luciferase  expression.  Since  the  deletion  mutant  D828-551 
and  D220-1  retains  the  most  enhancing  activity,  the  230-nucleotide  central  region  of  the  5’-UTR  (-551  to  -220) 
may  contain  the  important  elements  for  stimulating  firefly  luciferase  expression. 

However,  it  has  been  argued  that  the  use  of  dicistronic  vector  for  determination  of  IRES  activity  suffers 
an  inevitable  drawback  that  the  5’-UTR  sequence  may  contain  a  cryptic  promoter  (Kozak,  2001;  Schneider  and 
Kozak,  2001).  In  the  past,  this  argument  has  been  essentially  ignored.  To  address  this  potential  problem,  we 
created  a  promoterless  dicistronic  vector  by  deleting  the  SV40  promoter  of  the  vector.  We  argued  that  if  the  5’- 
UTR  sequence  contains  a  promoter  rather  than  an  IRES,  the  second  cistron  would  be  expressed  regardless  with 
or  without  the  SV40  promoter.  To  validate  this  newly  created  vector  system,  we  first  tested  the  well-known 
IRES  of  eIF4G  (Gan  et  al.,  1998;  Gan  and  Rhoads,  1996).  To  our  surprise,  we  found  that  the  known  IRES  of 
eIF4G  is  in  fact  a  promoter.  This  conclusion  was  supported  by  several  other  evidence  such  as  RNA  transfection 
assay  and  mapping  the  promoter  sequence  and  identification  of  transcription  factors  involved.  This  study  has 
recently  been  published  in  Molecular  and  Cellular  Biology  (see  appendix).  We  plan  to  use  this  newly  created 
promoterless  dicistronic  vector  and  RNA  transfection  assays  to  test  the  IRES  activity  in  the  5’-UTR  sequence  of 
PTEN. 

KEY  RESEARCH  ACCOMPLISHMENTS 

1.  Cloned  the  5’-UTR  sequence  of  PTEN  into  the  in  vitro  expression  system  and  determined  its 
inhibitory  effect  on  translation  of  PTEN  and  a  heterologous  reporter  gene  sequence. 

2.  Cloned  the  5’-UTR  sequence  of  PTEN  into  the  traditional  dicistronic  expression  system  and 
determined  its  activity  to  stimulate  the  expression  of  the  second  cistron. 

3.  Created  the  deletion  mutant  and  mapped  the  region  of  the  5’-UTR  sequence  of  PTEN  which  has  the 
activity  to  stimulate  the  expression  of  the  second  cistron  in  the  dicistronic  assay. 

4.  Created  the  new  promoterless  dicistronic  assay  system  for  IRES  assay  and  found  that  the  known 
IRES  published  may  be  misinterpreted  from  cryptic  promoters. 

REPORTABLE  OUTCOMES 

Han,  B.  &  Zhang,  J.T.  (2002).  Regulation  of  gene  expression  by  internal  ribosome  entry  site  (IRES)  or 
cryptic  promoters:  the  eIF4G  story.  Mol  Cell  Biol.  22, 7372-84. 

CONCLUSIONS 

In  conclusion,  the  long  5’-UTR  sequence  of  PTEN  inhibits  cap-dependent  translation.  Thus,  PTEN 
mRNA  with  the  long  5’-UTR  cannot  be  translated  into  protein  under  normal  conditions.  There  is  a  region  in  the 
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5’-UTR  sequence  of  PTEN  that  can  stimulate  the  expression  of  the  second  cistron  in  the  traditional  dicistronic 
assay.  However,  whether  this  region  contains  IRES  needs  to  be  tested  using  the  promoterless  dicistronic  vector 
recently  created  and  RNA  transfection  assays. 
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As  an  alternative  to  the  scanning  mechanism  of  initiation,  the  direct-internal-initiation  mechanism  postu¬ 
lates  that  the  translational  machinery  assembles  at  the  AUG  start  codon  without  traversing  the  entire  5' 
untranslated  region  (5'-UTR)  of  the  mRNA.  Although  the  existence  of  internal  ribosome  entry  sites  (IRESs) 
in  viral  mRNAs  is  considered  to  be  well  established,  the  existence  of  IRESs  in  cellular  mRNAs  has  recently 
been  challenged,  in  part  because  when  testing  is  carried  out  using  a  conventional  dicistronic  vector,  Northern 
blot  analyses  might  not  be  sensitive  enough  to  detect  low  levels  of  monocistronic  transcripts  derived  via  a 
cryptic  promoter  or  splice  site.  To  address  this  concern,  we  created  a  new  promoterless  dicistronic  vector  to  test 
the  putative  IRES  derived  from  the  5'-UTR  of  an  mRNA  that  encodes  the  translation  initiation  factor  eIF4G. 

Our  analysis  of  this  5'-UTR  sequence  unexpectedly  revealed  a  strong  promoter.  The  activity  of  the  internal 
promoter  relies  on  the  integrity  of  a  polypyrimidine  tract  (PPT)  sequence  that  had  been  identified  as  an 
essential  component  of  the  IRES.  The  PPT  sequence  overlaps  with  a  binding  site  for  transcription  factor 
C/EBPp.  Two  other  transcription  factors,  Spl  and  Ets,  were  also  found  to  bind  to  and  mediate  expression  from 
the  promoter  in  the  5'-UTR  of  eIF4G  mRNA.  The  biological  significance  of  the  internal  promoter  in  the  eIF4G 
mRNA  might  lie  in  the  production  of  an  N-terminally  truncated  form  of  the  protein.  Consistent  with  the  idea 
that  the  cryptic  promoter  we  identified  underlies  the  previously  reported  IRES  activity,  we  found  no  evidence 
of  IRES  function  when  a  dicistronic  mRNA  containing  the  eIF4G  sequence  was  translated  in  vitro  or  in  vivo. 

Using  the  promoterless  dicistronic  vector,  we  also  found  promoter  activities  in  the  long  5'-UTRs  of  human  Sno 
and  mouse  Bad  mRNAs  although  monocistronic  transcripts  were  not  detectable  on  Northern  blot  analyses.  The 
promoterless  dicistronic  vector  might  therefore  prove  useful  in  future  studies  to  examine  more  rigorously  the 
claim  that  there  is  IRES  activity  in  cellular  mRNAs. 


Translational  control  plays  an  important  role  in  regulating 
gene  expression  in  eukaryotic  cells  (9,  24,  30,  52,  54,  69,  73). 
Most  of  the  translational  regulation  occurs  at  the  level  of 
initiation,  which  is  usually  the  rate-limiting  step  in  protein 
synthesis  (15,  29,  44,  50,  68).  Initiation  of  translation  in  eu¬ 
karyotes  normally  depends  on  the  m7GpppN  5 '-cap  structure 
of  mRNA,  which  recruits  the  43S  ribosome  preinitiation  com¬ 
plex  via  interaction  with  the  cap  binding  protein  eIF4E  (69). 
The  translational  machinery  then  migrates  downstream  until  it 
meets  the  first  AUG  codon  in  the  optimal  context  (41,  43). 
This  scanning  model  implies  that  the  migration  of  the  43  S 
preinitiation  complex  is  inhibited  by  any  long  and  highly  struc¬ 
tured  5 '-untranslated  regions  (5'-UTRs).  About  90%  of  5'- 
UTRs  of  vertebrate  mRNAs  have  a  length  in  the  range  be¬ 
tween  100  and  300  bases,  which  is  compatible  with  the 
ribosome-scanning  model.  However,  the  remaining  mRNAs 
have  longer  atypical  5'-UTRs  which  may  also  contain  one  or 
more  AUG  codons.  These  mRNAs  may  not  be  efficiently 
translated  using  the  cap-dependent  initiation  and  ribosome¬ 
scanning  mechanism  (73).  Furthermore,  cap-dependent  trans¬ 
lation  initiation  is  compromised  under  many  cellular  condi¬ 
tions  including  mitosis  (5),  apoptosis  (10,  49),  cellular  stresses 
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such  as  hypoxia  and  heat  shock  (55,  60,  64),  and  viral  infection 
(45,  53,  56). 

Under  the  aforementioned  cellular  conditions,  many  pro¬ 
teins  still  need  to  be  synthesized  possibly  via  the  internal  ribo¬ 
some  entry  site  (IRES)-mediated  initiation  mechanism  (36).  In 
this  mechanism,  the  ribosome  preinitiation  complex  is  re¬ 
cruited  directly  to  the  IRES  element  independent  of  the  5'-cap 
structure  and  begins  to  scan  within  the  vicinity  of  the  initiator 
AUG  codon  by  effectively  avoiding  a  large  part  of  the  5'-UTR 
sequence.  The  IRES-mediated  initiation  was  first  demon¬ 
strated  a  decade  ago  in  picornaviruses  (3, 19,  27,  71).  Picoma- 
virus  mRNAs  are  naturally  uncapped  at  their  5'  ends.  Their 
5'-UTRs  usually  have  significant  secondary  structure  spanning 
approximately  over  500  bases  and  are  punctuated  by  multiple 
AUG  codons.  Such  a  structure  is  predicted  to  inhibit  the  ribo¬ 
some  recruitment  and  linear  ribosome  scanning.  However, 
these  5'-UTRs  can  effectively  confer  a  cap-independent  trans¬ 
lation  by  directly  recruiting  ribosomes. 

Recently,  it  has  been  suggested  that  some  cellular  mRNAs 
may  also  use  the  IRES-mediated  initiation  mechanism  (27,  61, 
71).  Indeed,  over  40  mRNAs  have  been  reported  to  use  IRESs 
to  direct  translation  initiation,  and  the  list  is  growing  rapidly. 
An  IRES  database  (http://www.rangueil.inserm.fr/IRESdatabase) 
has  been  created  to  reflect  their  potential  importance  and  to 
categorize  the  cellular  IRES  elements.  However,  the  existence 
of  such  cellular  IRESs  has  been  questioned  recently  due  to  the 
inevitable  drawbacks  of  the  conventional  dicistronic  test  and 
the  fact  that  dicistronic  mRNA  assays  do  not  work  for  the 
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known  cellular  IRESs  (42).  This  has  generated  debate  over  the 
existence  of  cellular  IRESs  and  how  studies  should  be  con¬ 
ducted  on  future  cellular  IRES  candidates  (63).  In  most  of  the 
previous  cellular  IRES  studies,  an  artificial  dicistronic  con¬ 
struct  was  used  in  which  a  candidate  IRES  element  is  inserted 
between  the  two  cistrons.  The  upstream  cistron  acts  as  an 
indicator  of  cap-dependent  translation,  and  the  downstream 
cistron  indicates  the  IRES  activity.  This  approach  is  considered 
a  “gold  standard”  for  characterizing  cellular  IRESs  (61).  How¬ 
ever,  it  is  acknowledged  by  both  sides  of  the  debate  that  one  of 
the  major  questions  is  whether  cryptic  promoter  or  potential 
splice  sites  exist  in  the  “cellular  IRES”  sequence  of  the  dicis¬ 
tronic  construct  that  may  generate  monocistronic  mRNA  of 
the  second  cistron.  Although  RNA  analyses  such  as  Northern 
blotting  were  conducted  to  rule  out  the  existence  of  monocis¬ 
tronic  mRNA  in  most  previous  studies.  Kozak  (42,  63)  argued 
that  the  RNA  analysis  approaches  used  in  these  studies  were 
not  sensitive  enough  to  rule  out  the  existence  of  monocistronic 
mRNA  present  at  5%  or  less  of  the  dicistronic  mRNA  level. 
Thus,  some  of  the  previously  reported  cellular  IRESs  might  be 
falsely  identified,  and  it  is  clear  that  relevant  rigorous  experi¬ 
mental  methods  are  needed  to  clarify  these  disputes. 

In  this  study,  we  created  a  new  promoterless  dicistronic 
vector  to  rule  out  the  existence  of  cryptic  promoters  in  any 
potential  cellular  IRES  elements.  This  analysis  also  relies  on 
enzyme  assays  in  the  same  way  as  the  conventional  dicistronic 
test  for  IRES  activity  and  therefore  is  more  compatible  with 
IRES  study  than  any  existing  RNA  analysis  approaches.  Using 
this  vector,  we  first  tested  the  5'-UTR  of  eukaryotic  translation 
initiation  factor  4GI  (eIF4G),  the  largest  subunit  of  eIF4F. 
eIF4G  is  important  for  translational  control  by  acting  as  a 
molecular  scaffold  and  coordinating  the  activities  of  eIF3, 
eIF4A,  eIF4E,  and  poly(a)-binding  protein  (PABP)  (22,  28, 
39).  The  translation  of  eIF4G  is  cap  independent  under  con¬ 
ditions  of  viral  infection  (37)  and  involves  an  IRES-dependent 
mechanism  (20,  21,  37).  The  5'-UTR  of  eIF4G  has  been  func¬ 
tionally  characterized  and  shown  to  have  a  polypyrimidine 
tract  (PPT)  region  that  is  essential  for  IRES  activity  (20,  21). 

Using  the  novel  promoterless  dicistronic  vector,  we  demon¬ 
strated  that  a  strong  promoter  activity  exists  in  the  5'-UTR 
sequence  of  eIF4G  that  can  explain  the  enhanced  expression 
of  the  second  cistron  observed  in  the  conventional  dicistronic 
tests.  Further  mapping  of  the  promoter  sequence  in  the  5'- 
UTR  of  eIF4G  showed  that  the  PPT  sequence  was  absolutely 
required  for  the  promoter  activity.  Using  gel  shift  and  super¬ 
shift  analysis,  we  showed  that  transcription  factor  C/EBPp 
binds  to  this  PPT  sequence.  In  addition,  the  5'-UTR  of  eIF4G 
could  not  direct  IRES-dependent  translation  in  rabbit  reticu¬ 
locyte  lysate  (RRL)  or  in  HeLa  cells  when  dicistronic  RNA 
transcripts  were  analyzed.  Thus,  the  cryptic  promoter  in  the 
5'-UTR  of  eIF4G  underlies  the  enhanced  expression  of  the 
second  cistron  observed  in  the  conventional  dicistronic  test. 
Using  the  same  promoterless  dicistronic  vector,  we  also  ob¬ 
served  promoter  activities  in  the  5'-UTR  sequences  of  human 
Sno  and  mouse  Bad,  although  no  monocistronic  transcripts 
were  detected  by  northern  blot.  Hence,  cryptic  promoter  ac¬ 
tivities  in  the  5'-UTR  sequences  of  cellular  mRNAs  are  more 
prevalent  than  anticipated,  and  they  can  be  misinterpreted  to 
be  IRES  sequences  in  the  conventional  dicistronic  analysis  due 
to  the  low  intrinsic  sensitivity  of  Northern  blot  analysis  for 


ruling  out  the  existence  of  monocistronic  transcripts.  We  pro¬ 
pose  that  the  promoterless  dicistronic  vector  may  be  used  to 
establish  claims  of  cellular  IRES  in  future  studies. 

MATERIALS  AND  METHODS 

Materials.  Restriction  enzymes,  m7GpppG  cap  analogue,  Pfu  polymerase,  and 
SL2  cell  line  were  purchased  from  New  England  Biolabs.  AmershamfPharmaeia 
Biotech,  Stratagcnc,  and  the  American  Type  Culture  Collection,  respectively.  T7 
and  Sp6  RNA  polymerases.  RNasin,  RNase-free  DNase,  rabbit  reticulocyte 
lysate,  Luciferasc  assay  ‘Stop  &  Glo’  kit.  pGL3-Promoter  plasmid,  and  pSP64 
poly(A)  plasmid  were  from  Promega.  RNeasy  Mini  Kit  and  Oligotex  mRNA 
Mini  Kit  were  from  Oiagen.  Rediprime  II  Random  Prime  Labeling  System  and 
p-pj-dCTP  were  from  Amersham  Bioscienccs.  The  Sephadex  G-25  Quick  Spin 
Columns  (TE)  for  radiolabeled  DNA  purification  were  from  Roche.  MAGNA 
nylon  transfer  membrane  was  from  Osmonics  Inc.  Wild-type  and  mutant 
C/EBPp  oligonucleotides  used  in  the  electrophoretic  mohility  shift  assay 
(EMSA)  and  antibodies  to  nuclear  proteins  Spl.  Sp3.  and  C/EBPp  were  ob¬ 
tained  from  Santa  Cruz  Biotechnology.  Schneider’s  Drosophila  culture  medium. 
Lipofectamine  Plus,  and  Lipofectin  transfection  reagent  were  purchased  from 
Invitrogen.  Oligonucleotides  were  synthesized  by  Sigma-Genosys.  I.M.A.G.E. 
expressed  sequence  tag  EST  clones  were  obtained  from  either  the  American 
Type  Culture  Collection  or  Research  Genetics. 

Construction  of  plasmids.  To  engineer  dicistronic  constructs  containing  the 
IRES  element  of  human  rhinovirus  (HRV)  or  the  S’-UTR  of  eIF4G,  the  dicis¬ 
tronic  vector  pRF  (70)  was  used.  Plasmid  pGL2/CAT/4G/LUC  containing  the 
5'-UTR  of  eIF4G  (20)  was  used  as  template  for  PCR  to  amplify’  the  5’-UTR  of 
eIF4G  by  using  the  following  two  primers  containing  Spe I  and  Ncol  sites:  5'-C 
AAACTAGTTCTAGATGGGGGTCCT-3'  and  5'-CAACCATGGTGATATC 
CTTTCCTCC-3'.  The  PCR  product  was  cloned  into  the  tyel  and  Ncol  sites  of 
pRF  vector  to  obtain  plasmid  pR-eIF4G-F  (see  Fig.  1A).  The  IRES  sequence  of 
HRV  in  pGL3-R-HRV  plasmid  (70)  was  released  by  digestion  with  Spe  I  and 
Ncol  and  cloned  into  pRF,  resulting  in  pR-HRV-F. 

To  engineer  promoterless  dicistronic  constructs  that  allow  analysis  of  pro¬ 
moter  activity  of  the  DNA  insert  in  the  intergenic  region,  the  simian  virus  40 
(SV40)  promoter  sequence  including  the  chimeric  intron  between  Smal  and  Eco 
RV  was  removed  from  pRF,  pR-eIF4G-F,  and  pR-HRV-F  (see  Fig.  1A),  result¬ 
ing  in  pRF  (-P),  pR-eIF4G-F(-P),  and  pR-HRV-F(-P),  respectively.  For  all 
the  promoter  analysis  studies,  the  promoterless  dicistronic  constructs  were  used. 
In  some  studies,  the  SV40  enhancer  was  deleted  by  cloning  the  Nhel-Hpal 
fragment  from  the  promotcrless  dicistronic  plasmid  pR-eIF4G-F(-P)  into  the 
pGL3-promotcr  vector  (Promega).  which  contains  an  SV40  poly  (A)  signal  but 
does  not  contain  the  SV40  enhancer.  The  resulting  construct  is  named  pR- 
elF4G-F(-PE).  Systematic  deletions  of  the  5’-UTR  sequence  of  eIF4G  were 
generated  by  PCR.  The  elF4G  linker-scanning  constructs  were  made  by  the 
PCR-based  overlap  extension  technique,  which  is  similar  to  the  PCR-based 
site-directed  mutagenesis  method  described  previously  (25).  A  IQ-base  linker 
with  the  sequence  ACTCTAGACT  was  used  to  replace  wild-type  sequences. 

To  generate  poly(A)-tailed  in  vitro  transcripts  for  RNA  transfection  study, 
constructs  containing  poly(A)  were  engineered  using  the  vector  pSP64  Poh’A 
(Promega),  which  has  a  30-bp  (dA-dT)  sequence.  The  EcaKN-Xbal  fragment  of 
pRF  vector  that  contains  the  Renilla  luciferase  gene  was  first  cloned  into  the 
pSP64  PolyA  vector  at  the  Xbal  and  blunted  Mndlll  sites  to  generate  plasmid 
pSP-R,v„,.  The  Xhal  fragment  of  pR-HRV-F  that  contains  the  IRES  of  HRV 
and  the  firefly  luciferasc  gene  were  then  isolated  and  cloned  into  pSP-RA1n  at  the 
Xhal  site  to  generate  pSP-R-HRV-FA10.  The  pSP-RFAW  plasmid  was  obtained 
by  removing  the  IRES  sequence  of  HRV  from  pSP-R-HRV-F,vw  by  digestion 
with  Spcl  and  Ncol.  To  engineer  p SP - R - e  1  Ft G - FA the  5’-UTR  sequence 
(from  —368  to  -44)  of  eIF4G  that  has  been  reported  to  contain  full  IRES 
activity  (20)  was  amplified  by  PCR  using  the  following  primers  with  Spel  and 
Ncol  sites:  5'-CAAACT AGTCTAGATGGGGGTCCT-3'  and  5‘-ACACCATG 
GATTCGGATCTGGGGA-3'.  The  PCR  products  were  used  to  replace  the 
IRES  sequence  of  HRV  in  pSP-R-HRV-FA,„. 

To  clone  the  5'-UTRs  of  mouse  Bad.  human  Sno.  and  human  inhibitor  apo¬ 
ptosis  1  (h!AP-l,  referred  to  hereafter  as  HIAP)  in  both  pRF  and  pRF(-P) 
vectors.  I.M.A.G.E.  EST  clones  were  used  as  templates.  The  405-bp  5'-UTR 
sequence  of  mouse  Bad  (accession  no.  NM_007522)  was  amplified  using  mouse 
I.M.A.G.E.  clone  948554  (accession  no.  AA544696)  with  the  primers  5’-GACT 
AGTCGCACACCTATCCTGGCA-3'  and  5  -GCCATGGTTGGATCCTGGA 
GGCCTG-3'.  The  709-bp  5‘-UTR  sequence  of  human  Sno  (accession  no: 
U70730)  was  assembled  using  I.M.A.G.E.  clones  1713010  (accession  no. 
All 29968)  and  211285  (accession  no.  H69011),  and  finally  amplified  by  the 
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FIG.  1.  Stimulation  of  the  second-cistron  expression  by  the  5'-UTR  sequence  of  eIF4G.  (A)  Schematic  diagram  of  dicistronic  constructs 
without  insert  (pRF),  or  with  the  IRES  of  HRV  (pR-HRV-F)  and  the  5'-UTR  of  eIF4G  (pR-eIF4G-F)  in  the  intergenic  region.  The  locations  of 
several  relevant  restriction  enzyme  sites  are  shown  by  arrows.  (B)  Relative  luciferase  activity  generated  by  the  dicistronic  constructs.  HeLa  cells 
were  transfected  with  pRF,  pR-HRV-F,  and  pR-eIF4G-F  constructs.  At  24  h  following  transfection,  the  cells  were  harvested,  the  Renilla  and  firefly 
luciferase  (R.  Luc.  and  F.  Luc.)  activities  were  measured,  and  the  relative  ratios  were  calculated  and  normalized  to  that  of  the  vector-transfected 
cells  (RF).  The  data  are  from  six  independent  experiments. 


following  two  primers:  5'-AACTAGTGGTTTCAAATTGGCCCT-3'  and  5'-G 
GTTTTCCATGGTACACTCT-3 ' .  The  1.3-kb  5'-UTR  sequence  of  human 
HIAP  (accession  no.  AF070674)  was  assembled  using  human  I.M.A.G.E.  clones 
2349482  (accession  no.  AI827488)  and  428231  (accession  no.AA002125)  by  a 
combination  of  PCR  and  restriction  digestion.  All  of  the  5'-UTR  cDNA  frag¬ 
ments  were  finally  cloned  between  the  Spe I  and  Ncol  sites  of  the  pRF  and 
pRF(-P)  vectors,  resulting  in  pR-Bad-F,  pR-Sno-F,  pR-HIAP-F,  pR-Bad- 
F(— P)  and  pR-Sno-F(-P).  All  the  above  constructs  were  confirmed  by  DNA 
sequencing. 

Cell  culture  and  DNA  and  RNA  transfection.  HeLa  and  H1299  cells  were 
maintained  at  37°C  under  5%  C02  in  Dulbecco  modified  Eagle  medium  and 
RPMI  1640  medium,  respectively,  supplemented  with  10%  fetal  bovine  serum. 
Schneider’s  Drosophila,  cell  line  2  (SL2)  was  maintained  at  room  temperature 
under  atmospheric  CO  2  in  Schneider’s  Drosophila  medium  supplemented  with 
10%  fetal  bovine  serum. 

DNA  transfection  in  both  HeLa  and  H1299  cells  was  performed  with  Lipo- 
fectamine  Plus  reagents  as  specified  by  the  manufacturer.  In  a  24-well  plate, 
approximately  105  cells/well  were  plated  and  were  transfected  with  0.4  pg  of 
DNA  the  next  day.  Cells  were  harvested  for  the  luciferase  assay  24  h  following 
transfection.  SL2  cells  were  transfected  using  the  calcium  phosphate  method  as 
previously  described  (25).  Briefly,  4  pg  of  pR-eIF4G-F(-P)  or  pR-eIF4G- 
F(-PE)  was  cotransfected  with  2  pg  of  Drosophila  expression  plasmids  for  Spl, 
Ets-1  (38),  or  combination  of  Spl  plus  Ets-1.  Equal  amounts  of  DNA  were 
obtained  in  each  transfection  by  adding  empty  vector  PacO.  Cells  were  incubated 
with  calcium  phosphate-precipitated  DNA  for  48  h  and  then  harvested  for  the 
luciferase  assay. 

RNA  transfection  was  performed  using  the  cationic  liposome-mediated 
method  as  described  by  Dwarki  et  al  (16).  Briefly,  approximately  2  X  105  cells/ 
well  were  seeded  onto  six-well  plates  on  the  day  before  transfection.  Opti-MEM 
I  medium  (1  ml)  in  a  12-by  75-mm  polystyrene  snap-cap  tube  was  mixed  with  12.5 
pg  of  Lipofectin  reagent  and  5  pg  of  capped  mRNA.  The  liposome-RNA- 
medium  mixture  was  immediately  added  to  cells.  At  8  h  following  transfection, 
the  cells  were  harvested  and  processed  for  luciferase  analysis. 

Preparation  of  cytoplasmic  extract  (S100).  Cytoplasmic  extract  (S100)  was 
prepared  as  previously  described  (31).  Briefly,  HeLa  cells  were  collected  using 
rubber  policeman,  washed  with  phosphate-buffered  saline  and  then  resuspended 
at  a  density  of  5  X  107  cells/ml  in  homogenization  buffer  H100  (10  mM  Tris-HCl 
[pH  7.4],  1.5  mM  MgCl2,  10  mM  KC1,  0.5  mM  dithiothreitol,  0.1  mM  phenyl- 
methylsulfonyl  fluoride,  10  pg  of  leupeptin  per  ml).  The  cells  were  homogenized 
with  a  Dounce  homogenizer  for  20  strokes  on  ice.  Cell  nuclei  were  pelleted  by 
centrifugation  at  2,000  X  g  for  10  min.  The  supernatant  was  then  adjusted  to  150 
mM  KC1  and  centrifuged  at  100,000  X  g  at  4°C  for  90  min.  The  supernatant  was 
recovered  as  S100  extract.  The  extract  was  flash  frozen  with  5%  glycerol  and 
stored  at  —  70°C. 

In  vitro  transcription  and  translation.  In  vitro  transcription  and  translation 
were  performed  as  previously  described  (75).  DNA  templates  with  a  poly(A)  tail 
were  linearized  with  EcoRl,  while  DNA  templates  without  a  poly(A)  tail  were 
linearized  with  BamHl.  Capped  transcripts  were  synthesized  in  the  presence  of 
1  mM  m7GpppG  and  purified  using  the  Qiagen  RNeasy  mini  kit.  A  50-ng  portion 


of  capped  RNA  transcripts  was  used  to  program  cell-free  translation  in  rabbit 
reticulocyte  lysate  (RRL)  in  a  final  volume  of  10  pi  containing  6,5  pi  of  RRL. 
The  translation  mixture  contained  2  pi  of  either  H100  buffer  or  cytoplasmic 
extract  S100  (8.0  pg/pl). 

Nuclear  extract  preparation,  EMSA,  and  UV  cross-linking.  HeLa  nuclear 
extract  preparation,  EMSA,  and  supershift  analysis  were  done  as  previously 
described  (25).  The  two  strands  of  oligonucleotides  used  as  EMSA  probes  were 
annealed  prior  to  labeling.  The  sequences  of  the  sense  strand  of  the  oligonucle¬ 
otide  probes  are  5'-TAGCnTCTTTCCCCAGATCC-3'  (eIF4G  C/EBPp  [-68 
to  -48]),  5 ' -GAGGTGGGCTCTTCCTGCTTCC-3 '  (eIF4G  Ets-1  [-101  to 
-80]),  and  5'-GCTGGGGGGTGGGGAGTTGG-3'  (eIF4G  Spl  [-151  to 
-132]).  The  oligonucleotides  used  for  competition  analysis  are  wild-type  MRG1 
Spl  (5 ' -TTAAGCTTCGCTCCGCCCTTCC-3 ' ),  mutant  MRG1  Spl  (5'-TTAA 
GCTTCGCTITGCCCTTCC-3')  (25),  wild-type  C/EBPp  (5'-TGCAGATTGC 
GCAATGTGCA-3'),  and  mutant  C/LBPfl  (5 ' -TGCAGAGACTAGTCTCTGC 
A-3').  For  UV  cross-linking,  the  reaction  mixture  containing  probe  and  nuclear 
extract  was  incubated  at  room  temperature  for  20  min  and  then  irradiated  with 
UV  at  254  nm  at  a  distance  of  5  cm  for  30  min  by  using  a  UV  Stratalinker  1800 
(Stratagene).  The  mixture  was  then  separated  by  sodium  dodecyl  sulfate-poly- 
acrylamide  gel  electrophoresis  (SDS-PAGE)  (10%  polyacrylaide). 

Northern  blot  analysis.  Subconfluent  H1299  cells  in  10-cm  plates  were  trans¬ 
fected  with  constructs  (4  pg/plate)  using  Lipofectamine  Plus.  The  total  RNAs 
were  extracted  using  an  RNeasy  mini  kit  48  h  following  transfection.  Residual 
plasmid  DNA  in  the  total  RNA  was  digested  with  RNase-free  DNase.  The 
poly(A)  RNAs  were  then  isolated  from  250  pg  of  total  RNAs  using  the  Oligotex 
mRNA  mini  kit.  One-fifth  of  the  mRNAs  were  separated  in  1%  agarose  gels  in 
the  presence  of  formaldehyde  and  morpholinepropanesulfonic  acid  (MOPS) 
buffer  and  blotted  onto  MAGNA  nylon  membranes.  The  blots  were  hybridized 
with  a  32P-labeled  firefly  luciferase  DNA  probe  (1,656  bp),  which  was  isolated 
from  pRF  by  cleaving  with  Ncol  and  Xbal  and  labeled  using  Rediprime  II 
random-prime  labeling  system. 

RESULTS 

The  5'-UTR  of  eIF4G  directs  expression  of  the  second  cis- 
tron  in  a  dicistronic  construct.  To  further  analyze  the  IRES  of 
eIF4G,  we  reengineered  the  dicistronic  plasmids  by  cloning  the 
cDNA  encoding  the  5'-UTR  of  eIF4G  into  a  widely  used 
dicistronic  vector,  pRF  (70)  (see  also  Fig.  1A).  This  vector 
contains  the  SV40  promoter  to  direct  the  transcription  of  di¬ 
cistronic  RNA  encoding  Renilla  luciferase  as  the  first  cistron 
and  firefly  luciferase  as  the  second  cistron.  Following  the  SV40 
poly(A)  signal,  there  is  an  SV40  enhancer  that  enhances  the 
promoter  activity.  The  IRES  sequence  of  HRV  was  engineered 
in  the  same  way  as  eIF4G  and  was  used  as  a  control  (Fig.  1A). 
These  plasmids  were  transfected  into  HeLa  cells,  and  both 
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FIG.  2.  Translation  of  dicistronic  mRNA  in  HeLa  cells  and  in  RRL.  (A)  Schematic  diagram  of  the  dicislronic  mRNA  used  for  translation  in 
HeLa  cells.  In  vitro  transcripts  with  5'-cap  (m  7G)  and  3'-poly(A)  tail  (A,,,)  were  synthesized  using  T7  RNA  polymerase  from  linearized  vector 
alone  (RF^q),  constructs  containing  the  IRES  of  HRV  (R-IIRV-FAW),  and  the  5'-UTR  of  eIF4G  (R-eIF4G-F/un).  R-,  Renilla;  F.,  firefly.  (B) 
Relative  luciferase  activity  from  dicistronic  mRNAs  in  HeLa  cells.  HeLa  cells  were  transfected  with  the  dicistronic  mRNAs,  and  8  h  following 
transfection,  Renilla  and  firefly  luciferase  (R.  Luc.  and  F.  Luc.)  activities  were  measured  and  the  relative  ratios  were  calculated  and  normalized 
to  that  of  the  vector-transfected  cells  (RF).  (C)  Relative  luciferase  activity  from  dicistronic  RNA  in  RRL.  Capped  dicistronic  transcripts  were 
synthesized  using  T7  RNA  polymerase  from  linearized  pRF,  pR-HRV-F,  and  pR-eIF4G-F  (Fig.  1).  The  in  vitro  transcripts  were  then  used  to 
program  translation  in  RRL  in  the  presence  of  buffer  alone  (H100)  or  the  HeLa  cytoplasmic  extract  (S100).  Following  the  cessation  of  translation, 
Renilla  and  firefly  luciferase  activities  were  measured,  the  ratio  of  firefly  to  Renilla  activity  was  determined,  and  the  relative  ratios  were  calculated 
and  normalized  to  the  vector  control  in  the  presence  of  buffer  (H100).  The  data  are  representative  of  three  independent  experiments.  The  gels 
shown  at  the  bottom  are  in  vitro  transcripts  (500  ng  each)  separated  on  a  1%  agarose  gel  that  were  used  for  in  vitro  and  in  vivo  studies,  respectively. 


Renilla  and  firefly  luciferase  activities  were  measured.  As 
shown  in  Fig.  IB,  the  5'-UTR  of  eIF4G  displays  an  unusually 
high  activity  in  directing  the  expression  of  firefly  luciferase. 
The  activity  is  ~850-fold  higher  than  that  of  the  control  vector 
pRF  and  ~40-fold  higher  than  that  of  the  HRV  IRES.  Similar 
results  were  observed  after  transfecting  other  cell  lines  such  as 
HI 299  cells  (data  not  shown).  This  observation  is  consistent 
with  previous  studies  which  suggested  that  the  stimulation  of 
the  second  cistron  expression  was  due  to  IRES  activities  in  the 
5'-UTR  of  eIF4G  (20,  21). 

Translation  of  dicistronic  RNA  transcripts  in  HeLa  cells 
and  RRL.  The  above  results  in  Fig.  1  suggest  that  (i)  the 
5'-UTR  of  eIF4G  contains  IRES  activity  which  enhances  the 
translation  of  firefly  luciferase  from  the  dicistronic  mRNA  by 
internal  initiation,  as  suggested  previously  (20,  21);  (ii)  the 
5'-UTR  of  eIF4G  may  contain  a  cryptic  promoter  which  di¬ 
rects  transcription  of  the  firefly  luciferase  gene;  and/or  (iii)  the 
5'-UTR  of  eIF4G  contains  a  cryptic  splicing  acceptor  site 
which  creates  a  splicing  variant  with  only  the  second  cistron  of 
the  firefly  luciferase  gene.  To  distinguish  between  these  possi¬ 
bilities,  we  first  generated  dicistronic  RNAs  in  vitro  from  the 
dicistronic  constructs  and  used  them  to  program  translation 
both  in  HeLa  cells  and  in  RRL.  RNA  transfection  is  one  of  the 


major  methods  for  characterizing  translation  efficiency  and 
identifying  eukaryotic  regulatory  factors  influencing  IRES  ac¬ 
tivity.  This  method  bypasses  the  complex  issue  of  transcrip¬ 
tional  regulation  and  requires  only  the  cytoplasmic  delivery  of 
the  transcripts.  For  purposes  of  RNA  transfection,  pSP- 
RFam,  pSP-R-eIF4G-FA30,  and  pSP-R-HRV-FA3n  were  engi¬ 
neered  and  used  for  producing  dicistronic  transcripts  contain¬ 
ing  the  m  7GpppG  cap  and  polyadenylated  tail  in  vitro  (Fig. 
2 A).  Transcripts  were  introduced  into  HeLa  cells  by  Lipofectin 
encapsulation  (16).  At  8  h  following  transfection,  cell  lysates 
were  prepared  for  luciferase  activity  measurement.  As  ex¬ 
pected,  the  firefly  luciferase  of  the  vector  RNA  was  very  poorly 
translated  and  its  activity  (arbitrary  units)  represented  only 
about  0.16%  of  that  of  Renilla  luciferase  (data  not  shown).  It 
increased  to  about  6.7%  with  the  dicistronic  RNAs  containing 
HRV  IRES.  Therefore,  HRV  IRES  significantly  stimulated 
the  translation  of  firefly  luciferase,  about  40-fold  higher  than 
that  of  the  vector  control  (Fig.  2B).  However,  no  stimulation  of 
firefly  luciferase  expression  was  observed  with  the  5'-UTR  of 
eIF4G.  We  next  used  the  dicistronic  RNA  to  program  trans¬ 
lation  in  RRL.  As  shown  in  Fig.  2C,  the  IRES  of  HRV  did  not 
show  any  enhancement  of  firefly  luciferase  expression  in  RRL 
(solid  bar),  indicating  that  RRL  may  lack  factors  necessary  for 
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FIG.  3.  Cryptic  promoter  activity  of  the  5'-UTR  of  eIF4G.  (A) 
Schematic  diagram  of  promoterless  dicistronic  construct  of  pRF(-P), 
pR-HRV-F(-P)  and  pR-eIF4G-F(-P).  The  sequences  of  the  SV40 
promoter  and  chimeric  intron  were  removed  from  the  parental  dicis¬ 
tronic  constructs  shown  in  Fig.  1A.  (B)  Relative  luciferase  activity 
generated  from  the  promoterless  constructs.  The  promoterless  con¬ 
structs  were  transfected  into  HeLa  cells,  and  24  h  following  the  trans¬ 
fection,  cells  were  harvested  for  determination  of  Renilla  and  firefly 
luciferase  (R.  Luc.  and  F.  Luc.)  activity.  The  relative  ratios  between 
firefly  and  Renilla  luciferase  activities  were  calculated  and  normalized 
to  that  of  the  vector-transfected  cells  [pRF(-P)].  The  data  were  from 
six  independent  experiments  performed  in  duplicate. 


FIG.  4.  Effects  of  deletions  on  the  promoter  activity  of  the  5'-UTR 
of  eIF4G.  (A)  Schematic  diagram  of  the  deletions  in  the  5'-UTR  of 
eIF4G.  The  positions  of  the  5'  and  3'  ends  of  each  deletion  are 
indicated  on  the  left  and  right,  respectively.  These  mutant  5'-UTRs 
were  engineered  into  the  promoterless  dicistronic  vector  at  the  inter- 
genic  region.  (B)  Relative  luciferase  activity  generated  from  the  wild- 
type  and  mutant  5'-UTR  sequences  of  eIF4G.  HeLa  cells  were  trans¬ 
fected  with  the  constructs  shown  in  panel  A,  and  24  h  following 
transfection,  Renilla  and  firefly  luciferase  (R.  Luc.  and  F.  Luc.)  activ¬ 
ities  were  measured  and  the  ratio  of  firefly  to  Renilla  luciferase  was 
calculated  and  normalized  to  that  of  the  wild  type  (WT)  control. 


IRES-dependent  translation  initiation.  The  addition  of  HeLa 
S100  cytoplasmic  extract  to  RRL  stimulated  ~40-fold  expres¬ 
sion  of  firefly  luciferase  under  the  control  of  IRES  of  HRV 
(Fig.  2C).  However,  no  stimulation  of  firefly  luciferase  expres¬ 
sion  was  observed  by  the  5'-UTR  of  eIF4G  in  either  the  ab¬ 
sence  or  presence  of  HeLa  S100  extract  (Fig.  2C),  confirming 
the  results  of  RNA  transfection  studies.  Thus,  the  5'-UTR  of 
eIF4G  may  not  contain  an  IRES  element  to  direct  cap-inde¬ 
pendent  translation  from  a  dicistronic  RNA  transcript. 

Cryptic  promoter  activity  in  the  5'-UTR  of  eIF4G.  To  ana¬ 
lyze  whether  the  5'-UTR  of  eIF4G  contains  any  promoter 
activity,  we  simply  removed  the  unique  SV40  promoter  to¬ 
gether  with  the  intron  sequence  from  the  dicistronic  con¬ 
structs.  These  promoterless  dicistronic  constructs  (Fig.  3A) 
were  then  transfected  into  HeLa  cells  for  determination  of 
both  Renilla  and  firefly  luciferase  activities  (Fig.  3B).  As  ex¬ 
pected,  both  the  Renilla  and  firefly  luciferase  activities  were 
minimal  but  could  be  detected  for  the  pRF(-P)  vector  control. 
Only  a  twofold  increase  in  firefly  luciferase  activity  was  ob¬ 
served  with  the  pR-HRV-F(-P)  construct.  This  small  increase 
is  in  dramatic  contrast  to  the  20-fold  increase  when  the  pR- 
HRV-F  construct  was  used,  as  shown  in  Fig.  IB.  Thus,  the 
enhanced  expression  of  firefly  luciferase  from  pR-HRV-F  con¬ 
structs  (Fig.  1)  was  not  due  to  production  of  monocistronic 


transcript  by  a  cryptic  promoter  in  the  HRV  sequence.  Sur¬ 
prisingly,  the  pR-eIF4G-F(-P)  construct  generated  more  than 
900-fold-higher  firefly  luciferase  activity  than  that  of  the  vector 
control.  This  increase  was  similar  to  that  generated  by  the 
pR-eIF4G-F  construct  shown  in  Fig.  IB,  suggesting  that  the 
significant  enhancement  in  the  firefly  luciferase  expression  by 
the  5'-UTR  sequence  of  eIF4G  is  probably  due  to  the  presence 
of  a  strong  promoter  in  this  region. 

Mapping  of  the  eIF4G  promoter  in  the  5'-UTR.  The  above 
studies  suggest  that  the  5'-UTR  sequence  of  eIF4G  contain  a 
cryptic  promoter  which  may  have  been  thought  to  be  an  IRES 
element  in  previous  studies.  To  further  validate  our  study,  we 
mapped  the  cryptic  promoter  by  deleting  DNA  sequences  sys¬ 
tematically  from  both  the  5'  and  3'  ends  of  the  5'-UTR  of 
eIF4G  in  the  promoterless  dicistronic  vector  (Fig.  4A).  As 
shown  in  Fig  4B,  the  promoter  activity  increased  significantly 
when  50  bases  was  deleted  from  the  5'  end  of  the  5'-UTR 
(D368-318),  indicating  that  there  is  a  repressor  element  within 
this  region.  The  promoter  activity  did  not  significantly  change 
after  further  deletion  up  to  200  bases  from  the  5'  end  (D368- 
168).  However,  deletion  of  an  additional  50  bases  from  the  5' 
end  resulted  in  a  dramatic  decrease  in  promoter  activity 
(D368-118),  suggesting  that  the  168  bases  upstream  of  the 
translation  start  site  contains  the  full  promoter  activity  of  the 
5'-UTR  of  eIF4G.  Furthermore,  deletion  of  the  region  from 
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FIG.  5.  Mapping  of  sequences  important  for  the  promoter  activity  in  the  5'-UTR  of  eIF4G.  (A)  Schematic  demonstration  of  the  linker-scanning 
mutations.  Linker-scanning  mutations  were  made  onto  the  full-length  or  truncated  D168  constructs.  The  putative  transcription  factor  binding  sites 
predicted  by  Matlnspector  software  are  shown  above  the  sequence.  The  dashed  line  indicates  the  polypyrimidine  tract  region  important  for  IRES 
activity.  WT,  wild  type.  (B  and  C)  Relative  luciferase  activity  of  wild-type  and  mutant  full-length  (B)  or  D368-168  (C)  constructs  in  HeLa  cells. 
At  24  h  following  transfection  of  the  constructs  into  HeLa  cells,  the  cells  were  harvested  for  determination  of  Rcnilla  and  firefly  luciferase  activity. 
The  ratio  of  firefly  to  Renilla  luciferase  activity  was  calculated  and  normalized  to  that  of  the  wild-type  sequences.  Data  are  from  three  independent 
experiments. 


—49  to  —1  with  respect  to  the  3'  end  did  not  significantly  affect 
promoter  activity,  whereas  deletion  of  the  region  from  -69  to 
- 1  resulted  in  complete  loss  of  the  promoter  activity.  Thus,  the 
region  from  -69  to  -49  upstream  of  the  translation  start  site 
probably  contains  the  critical  elements  for  the  promoter  in  the 
5'-UTR  sequence  of  eIF4G.  Interestingly,  this  region  corre¬ 
sponds  to  the  PPT  region  that  was  shown  previously  to  be 
critical  for  the  IRES  activity  of  eIF4G  (20,  21)  (see  Fig.  5 A  for 


the  sequence). 

Linker-scanning  analysis  of  the  cryptic  promoter  in  the  5'- 
UTR  of  eIF4G.  To  identify  potential  transcription  factor  bind¬ 
ing  sites  required  for  the  cryptic  promoter  activity  in  the  5'- 
UTR  of  eIF4G,  we  engineered  a  series  of  linker-scanning 
constructs  near  the  translation  start  site.  Linker-scanning  con¬ 
structs  were  engineered  into  the  full-length  5'-UTR  sequence 
by  site-directed  mutagenesis  in  which  10  successive  nucleotides 
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FIG.  6.  Identification  of  nuclear  proteins  binding  to  the  regions  from  -151  to  -132,  -101  to  -80,  and  -68  to  -48  in  the  5'-UTR  of  eIF4G. 
(A)  EMSA  and  supershift  analysis  of  a  probe  with  the  sequence  from  -151  to  -132.  HeLa  nuclear  extract  (N.  E.)  (10  p-g)  was  incubated  with  the 
probe  alone  (lane  2)  or  in  the  presence  of  unlabeled  Spl  (lane  3),  mutant  Spl  (lane  4)  oligonucleotide  as  competitors  or  in  the  absence  (lane  5) 
or  presence  of  anti-Spl  (lane  6),  anti-Sp3  (lane  7),  both  anti-Spl  and  anti-Sp3  (lane  8),  or  anti-C/EBPp  (lane  9)  antibodies  (Ab)  for  the  supershift 
assay.  The  DNA-protein  complexes  were  separated  by  PAGE.  The  DNA-protein  complexes  generated  are  indicated  by  arrows.  The  gel  of  lanes 
5  to  9  was  run  longer  than  the  gel  of  lanes  1  to  4  to  better  separate  the  protein  complexes.  (B)  EMSA  and  UV  cross-linking  of  a  probe  with  the 
sequence  from  -101  to  -80.  HeLa  nuclear  extract  (10  pg)  was  incubated  with  the  probe  alone  (lane  2)  or  in  the  presence  of  the  unlabeled  probe 
(lane  3)  or  Spl  oligonucleotide  (lane  4)  as  competitors.  Cross-linking  of  DNA-protein  complexes  was  performed  by  incubating  10  pg  of  nuclear 
extract  with  the  labeled  probe  followed  by  UV  irradiation  and  separation  by  SDS-PAGE  (lane  5).  (C)  EMSA,  supershift,  and  UV  cross-linking 
of  a  probe  with  the  sequence  from  -68  to  -48.  HeLa  nuclear  extract  (10  pg)  was  incubated  with  the  probe  alone  (lane  2),  or  in  the  presence  of 
the  unlabeled  probe  (lane  3),  wild-type  C/EBP|3  (lane  4)  or  mutant  C/EBPp  (lane  5)  oligonucleotides  as  competitors,  or  in  the  absence  (lane  6) 
of  presence  (lane  7)  of  anti-C/EBP(3  antibody  for  the  supershift  assay.  The  DNA-protein  complexes  were  separated  by  PAGE.  UV  cross-linking 
was  performed  in  the  way  same  as  described  for  panel  B,  with  the  probe  from  -68  to  -48.  (D)  Sequence  comparison  of  the  consensus  C/EBPp 
binding  site  with  the  probe  from  —68  to  -48.  Also  shown  are  the  wild-type  (w)  and  mutant  (m)  C/EBPp  oligonucleotides  used  for  the  EMSA  in 
panel  C.  Conserved  nucleotides  are  aligned  by  three  dots.  N  =  A,  C,  G,  or  T.  The  PPT  of  eIF4G  is  underlined.  Lane  1  in  all  experiments  (panels 
A  to  C)  is  a  control  consisting  of  the  probe  alone  incubated  without  nuclear  extract. 


were  replaced  with  the  sequence  ACTCTAGACT  (Fig.  5A). 
This  sequence  was  chosen  because  it  contains  no  sequence 
homologous  to  any  known  transcription  factor  binding  sites. 
These  constructs  were  then  transfected  into  HeLa  cells,  and 
the  promoter  activity  was  measured  using  cell  lysate  following 
transfection.  As  shown  in  Fig.  5B,  replacement  with  L4,  L5,  or 
L6  (corresponding  to  -79  to  -49  upstream  of  the  translation 
start  site)  resulted  in  significant  decreases  in  promoter  activity. 
This  observation  was  consistent  with  the  results  of  the  deletion¬ 
mapping  study  shown  in  Fig.  4B  (construct  D69-1).  However, 
the  results  in  Fig.  4B  (compare  the  results  for  constructs  D368- 
168,  D368-118,  and  D368-68)  also  suggested  that  this  region 
(—79  to  -49)  itself  did  not  contain  the  full  promoter  activity. 
To  identify  other  cis  elements,  we  next  engineered  the  same 
linker-scanning  constructs  into  the  D368-168  truncated  con¬ 
structs  since  it  contains  minimal  sequence  for  the  full  promoter 


activity.  As  shown  in  Fig.  5C,  two  additional  regions,  —149  to 
-139  (L13)  and  -89  to  -69  (L7  and  L6),  were  shown  to  be 
important  for  the  proximal  promoter  activity. 

Identification  of  transcription  factors  for  the  cryptic  pro¬ 
moter  in  the  5'-UTR  of  eIF4G.  The  above  results  suggest  that 
there  are  several  critical  regions  in  the  cryptic  promoter  of  the 
5'-UTR  of  eIF4G.  To  determine  the  biochemical  composition 
of  protein  complexes  binding  to  these  regions,  nuclear  extracts 
from  HeLa  cells  were  analyzed  by  EMSA  using  labeled  probes 
corresponding  to  regions  from  —151  to  —132,  —101  to  —80, 
and  —68  to  —48.  These  sequences  were  chosen  because  they 
are  overlapped  with  the  critical  regions  identified  by  promoter- 
mapping  analysis  shown  in  Fig.  4  and  5.  In  addition,  sequence 
analysis  (57)  showed  that  they  contain  Spl/Sp3,  Ets-1,  and 
C/EBPp  binding  sites,  respectively  (Fig.  5A).  As  shown  in  Fig. 
6A,  the  probe  from  -151  to  —132  formed  three  protein  com- 
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plexes  in  EMSA  (lanes  2  and  5).  Addition  of  cold  Spl  oligo¬ 
nucleotides  prevented  the  formation  of  the  three  complexes 
(lane  3).  In  contrast,  addition  of  cold  mutant  Spl  oligonucle¬ 
otides  had  no  effect  on  the  complex  formation  (lane  4).  Fur¬ 
thermore,  addition  of  Spl  antibody  to  the  reaction  mixture 
inhibited  the  formation  of  the  uppermost  complex  (lane  6) 
while  addition  of  Sp3  antibody  inhibited  the  formation  of  the 
two  lower  complexes  (lane  7).  Addition  of  both  antibodies 
completely  inhibited  the  formation  of  all  three  complexes  (lane 
8),  while  addition  of  C/EBP(3  antibody  had  no  effect  (lane  9). 
These  results  clearly  indicate  that  the  region  from  -151  to 
-132  contains  a  typical  Spl/Sp3  binding  site. 

The  region  from  -101  to  -80  was  predicted  to  contain  an 
Ets-1  binding  sequence  (Fig.  5A).  EMSA  analysis  showed  that 
two  specific  protein  complexes  were  formed  (Fig.  6B,  lane  2). 
Addition  of  cold  probe  inhibited  the  formation  of  these  com¬ 
plexes  (lane  3),  while  addition  of  nonrelevant  cold  Spl  probe 
had  no  effect  on  complex  formation  (lane  4).  As  shown  by  UV 
cross-linking  (lane  5),  at  least  six  major  proteins  with  different 
molecular  weights  bound  to  this  sequence,  indicating  that  this 
sequence  is  recognized  by  a  large  number  of  different  proteins. 
The  identities  of  these  proteins  remain  to  be  determined. 

The  region  from  -68  to  -48  was  predicted  to  contain  a 
C/EBPp  binding  sequence.  As  shown  in  Fig.  6D,  C/EBPp  bind¬ 
ing  sequence  involves  12  bases  with  a  core  sequence  of  (A/G) 
N(A/G)T(T/G)NNG(A/C)AA(T/G).  The  homologous  se¬ 
quence  of  C/EBP-p  in  the  5'-UTR  of  eIF4G  is  located  in  the 
region  from  -61  to  -50  that  overlaps  the  PPT.  EMSA  analysis 
showed  that  a  single  specific  protein  complex  was  formed  with 
this  probe  (Fig.  6C,  lane  2).  This  complex  is  specific,  since 
addition  of  cold  probe  prevented  the  formation  of  this  complex 
(lane  3).  The  addition  of  cold  probe  corresponding  to  the 
consensus  C/EBPp  binding  sequence  also  competed  for  the 
formation  of  the  complex  (lane  4).  In  contrast,  a  cold  mutant 
C/EBPp  probe  had  no  effect  on  complex  formation  (lane  5). 
Hence,  the  protein  involved  in  binding  to  the  sequences  of  -68 
to  -48  is  probably  C/EBPp.  This  conclusion  was  further  con¬ 
firmed  using  C/EBPp  antibody,  which  inhibited  the  formation 
of  the  complex  (lane  7).  UV  cross-linking  analysis  also  showed 
a  single  protein  with  the  same  molecular  weight  as  C/EBPp 
(lane  8).  Taken  together,  these  results  demonstrated  that  the 
region  from  -68  to  -48  contains  a  C/EBPp  binding  sequence. 

Transcription  factors  Spl  and  Ets-1  synergistically  trans- 
activate  the  activity  of  eIF4G  promoter  in  SL2  cells.  To  deter¬ 
mine  directly  whether  the  above  transcription  factors  could 
functionally  modulate  the  promoter  activity  in  the  5'-UTR 
sequence  of  eIF4G,  Drosophila  SL2  cells,  which  are  deficient  in 
Spl-,  Sp3-,  and  Ets-related  proteins  (12,  13),  were  used.  The 
reason  for  using  insect  instead  of  mammalian  cells  is  that  Spl-, 
Sp3-,  and  Ets-related  factors  are  expressed  in  virtually  all 
mammalian  cells,  which  could  affect  the  interpretation  of  this 
experiment.  We  introduced  the  pR-eIF4G-F(-P)  construct 
along  with  Drosophila  expression  plasmid  pPacSpl  or  pPac- 
UEts-1  into  Drosophila  SL2  cells.  As  shown  in  Fig.  7A,  both 
pPacSpl  and  pPacUEts-1  stimulated  the  promoter  activity  in 
the  5'-UTR  sequence  of  eIF4G.  To  test  the  possible  functional 
interplay  between  the  Spl  -  and  Ets-related  proteins,  we  per¬ 
formed  cotransfection  experiments  using  combinations  of  plas¬ 
mids  expressing  Ets-1  and  Spl.  As  shown  in  Fig.  7A,  pPacSpl 
and  pPacUEts-1  synergistically  stimulated  the  promoter  activ- 


FIG.  7.  Spl  and  Ets-1  proteins  activate  the  promoter  in  the  5'- 
UTR  sequence  of  eIF4G  in  Drosophila  SL2  cells.  pR-eIF4G-F(-P) 
(A)  and  pR-eIF4G-F(-PE)  (B)  were  transfected  into  SL2  cells  to¬ 
gether  with  Drosophila  expression  vector  (PacO),  or  plasmids  express¬ 
ing  transcription  factors  Spl  and  Ets-1.  At  48  h  following  transfection, 
the  cells  were  harvested  for  determination  of  firefly  (F.)  luciferase 
activity,  which  was  normalized  to  that  of  vector-transfected  cells.  The 
data  are  from  three  independent  experiments. 


ity  in  the  5'-UTR  sequence  of  eIF4G.  Since  our  promoterless 
construct  pR-eIF4G-F(-P)  contains  an  SV40  enhancer  down¬ 
stream  of  the  poly(A)  signal  sequence  that  may  complicate  the 
interpretation  of  the  above  results  (Fig.  3A),  we  removed  the 
SV40  enhancer  region  from  pR-eIF4G-F(-P)  by  cloning  its 
Nhe\-Hpa\  fragment  into  the  pGL3-promoter  vector,  which 
contains  intact  SV40  poly(A)  signal  but  without  SV40  en¬ 
hancer.  The  same  pattern  of  results  was  observed  (Fig.  7B). 
Thus,  the  trans  activation  of  promoter  by  Spl  and  Ets-1  occurs 
through  the  cis  elements  present  in  the  5'-UTR  sequence  of 
eIF4G. 

Analysis  of  5'-UTRs  of  Sno,  Bad,  and  HIAP  by  conventional 
and  promoterless  dicistronic  test.  To  determine  how  robust 
the  promoterless  dicistronic  test  is  in  differentiating  cryptic 
promoters  from  IRES  activities,  we  analyzed  three  additional 
cellular  5'-UTRs  by  using  both  conventional  and  promoterless 
dicistronic  tests  (Fig.  8A).  The  sno  gene  encodes  a  component 
of  the  histone  deacetylase  complex  and  can  act  as  a  tumor 
suppressor  in  mice  (67),  although  paradoxically  it  also  pro¬ 
motes  oncogenic  transformation  of  chicken  embryo  fibroblasts 
and  differentiation  of  quail  embryo  fibroblasts  (6).  The  human 
sno  gene  has  a  5'-UTR  of  700  bases  that  is  rich  in  AT.  Bad  is 
a  member  of  the  Bcl-2  family  of  proapoptotic  proteins  that  is 
thought  to  exert  a  death-promoting  effect  by  blocking  the  pro- 
survival  function  of  Bcl-XL  through  its  heterodimerization  with 
Bcl-XL  (7).  The  mouse  Bad  mRNA  has  a  5'-UTR  of  470  bases 
which  contains  a  small  upstream  open  reading  frame  encoding 
a  peptide  of  60  amino  acids.  Human  HIAP  is  a  member  of  the 
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FIG.  8.  Conventional  and  promoterless  dicistronic  test  of  the  5'-UTRs  of  Sno,  Bad,  and  HIAP.  (A)  Schematic  diagram  of  conventional  and 
promoterless  dicistronic  constructs  of  the  5'-UTRs  of  Sno,  Bad  and  HIAP.  (B)  Conventional  dicistronic  and  promoterless  dicistronic  constructs 
were  transfected  into  HeLa  cells.  At  24  h  following  transfection,  cells  were  harvested  for  enzyme  assay  and  the  ratio  of  Renilla  (R.)  to  firefly  (F.) 
luciferase  activity  was  determined  and  normalized  to  that  of  the  respective  vector  controls.  The  data  are  from  three  independent  experiments. 


LAP  protein  family  that  is  evolutionally  conserved  and  can 
block  apoptosis  when  expressed  in  cells  derived  from  multiple 
tissues  (14).  HIAP  mRNA  has  a  5'-UTR  of  ~3  kb,  which 
would  severely  inhibit  cap-dependent  translation  initiation 
(33). 

As  shown  in  Fig.  8B,  the  5'-UTRs  of  human  Sno  and  mouse 
Bad  both  significantly  increased  the  expression  of  the  second 
cistron  in  the  conventional  dicistronic  test  (open  bars),  sug¬ 
gesting  that  these  5'-UTRs  have  either  IRES  and/or  cryptic 
promoter  activity.  In  contrast,  no  increase  in  the  second-cistron 
expression  was  observed  with  the  5'-UTR  of  human  HIAP, 
indicating  that  it  does  not  contain  either  IRES  or  cryptic  pro¬ 
moter  activity.  Interestingly,  the  5'-UTRs  of  Sno  and  Bad 
stimulated  even  higher  expression  of  the  second  cistron  in  the 
promoterless  dicistronic  vector  (solid  bars).  This  observation 
strongly  argues  that  the  enhanced  expression  of  the  second 
cistron  observed  in  the  conventional  dicistronic  test  is  due  to 
the  promoter  rather  than  IRES  activity  in  the  5'-UTRs  of  Sno 
and  Bad. 

Northern  blot  analysis.  To  determine  whether  the  transcript 
derived  from  the  cryptic  promoter  of  eIF4G,  Sno,  and  Bad  can 
be  detected  by  Northern  blotting  using  the  firefly  luciferase 
gene  as  a  probe,  poly(A)  RNAs  were  isolated  for  Northern 
blot  analysis  48  h  following  transfection  of  dicistronic  con¬ 
structs  pRF,  pR-HRV-F,  pR-eIF4G-F,  pR-Sno-F,  pR-Bad-F, 
and  pR-eIF4G-F(-P).  As  a  control  for  monocistronic  tran¬ 
script,  the  pRF(— R)  construct  was  engineered  by  removing 
DNA  sequences  encoding  Renilla  luciferase  (the  0.95-kb 
EcoRV-Pvull  fragment  [Fig  1A]).  This  plasmid  uses  the  SV40 
promoter  to  direct  the  synthesis  of  the  firefly  luciferase  tran¬ 
script  without  the  Renilla  luciferase  sequence.  As  shown  in  Fig 
9A,  the  dicistronic  transcript  from  control  pRF  (lane  1)  and 
the  monocistronic  transcript  from  control  pRF(-R)  (lane  4) 
were  detected  as  expected.  The  transcript  produced  from  pR- 
eIF4G-F(— P)  has  a  size  similar  to  that  from  pRF(— R)  (com¬ 
pare  lanes  4  and  7)  as  expected,  suggesting  that  it  is  a  mono¬ 
cistronic  mRNA.  The  same  monocistronic  transcript  was  also 
observed  with  pR-eIF4G-F  (lane  3),  These  observations  con¬ 
firm  that  the  5'-UTR  of  eIF4G  has  a  strong  promoter  activity 
for  expressing  the  second  cistron  of  firefly  luciferase.  Surpris¬ 
ingly,  no  dicistronic  transcript  was  found  with  pR-eIF4G-F 
(lane  3). 

To  determine  whether  the  lack  of  intact  dicistronic  mRNA 
from  pR-eIF4G-F  was  due  to  the  low  sensitivity  of  Northern 
blot  analysis,  lysates  were  prepared  for  the  Renilla  luciferase 
activity  assay  from  the  same  cells  that  were  used  for  the  North¬ 


ern  blot  analysis  in  Fig.  9A.  As  shown  in  Fig.  9B,  pR-eIF4G-F 
displayed  about  12%  of  the  Renilla  luciferase  activity  of  pRF  or 
pR-HRV-F  (compare  column  3  with  columns  1  and  2),  sug¬ 
gesting  that  the  level  of  the  dicistronic  transcript  containing 
Renilla  luciferase  gene  is  relatively  low.  This  low  level  of  Re¬ 
nilla  luciferase  activity  was  consistently  observed  for  pR- 
eIF4G-F,  and  the  low  level  of  the  dicistronic  transcript  is  prob¬ 
ably  due  to  alternative  splicing  events  as  proposed  previously 
(see  Discussion).  This  event  would  splice  out  Renilla  luciferase 
sequences  from  the  dicistronic  transcripts  and  generate  a 
monocistronic  transcript  that  encodes  only  firefly  luciferase. 
The  minor  transcript,  with  a  size  slightly  larger  than  the  mono¬ 
cistronic  transcript  generated  from  pR-eIF4G-F  (Fig.  9A,  lane 
3),  may  represent  the  alternatively  spliced  products. 

Intact  dicistronic  mRNAs  were  produced  from  pR-HRV-F 
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FIG.  9.  Northern  blot  analysis  of  eIF4G,  Sno,  and  Bad.  (A) 
Poly(A)  RNAs  were  isolated  following  transfection  with  pRF  (lane  1), 
pR-HRV-F  (lane  2),  pR-eIF4G-F  (lane  3),  pRF(-F)  (lane  4),  pR- 
Sno-F  (lane  5),  pR-Bad-F  (lane  6),  and  pR-eIF4G-F(— P)  (lane  7)  and 
used  for  Northern  blot  analysis  as  described  in  Materials  and  Methods. 
The  asterisks  indicate  the  dicistronic  transcript,  the  arrowhead  indicate 
the  monosistronic  transcript  and  the  pound  sign  indicates  the  minor 
transcript.  (B)  Small  fraction  of  the  cells  transfected  with  various 
constructs  for  Northern  analysis  in  panel  A  were  used  to  prepare  cell 
lysates  for  determination  of  both  firefly  and  luciferase  activities. 
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(lane  2),  pR-Sno-F  (lane  5),  and  pR-Bad-F  (lane  6)  with  the 
expected  sizes.  No  monocistronic  transcripts  were  detected 
with  these  constructs.  These  observations  are  consistent  with 
the  conclusion  that  the  HRV  sequence  is  an  IRES  element 
that  does  not  contain  a  cryptic  promoter.  However,  the  lack  of 
detectable  monocistronic  transcript  from  Sno  and  Bad  would 
have  suggested  that  they,  too,  do  not  have  cryptic  promoters, 
had  we  not  performed  the  promoterless  dicistronic  test  (Fig. 
8).  To  determine  whether  the  firefly  luciferase  was  expressed  in 
the  cells  used  for  Northern  blot  analysis,  we  measured  the 
firefly  luciferase  activity.  As  shown  in  Fig.  9B,  the  firefly  lucif¬ 
erase  activity  of  Sno  and  Bad  is,  respectively,  —16-  and  —9-fold 
higher  than  that  generated  from  pRF  and  similar  to  that  of 
pR-HRV-F.  Based  on  these  observations,  we  conclude  that 
Northern  blot  analysis  is  not  sensitive  enough  to  detect  mono¬ 
cistronic  transcripts  generated  from  moderate  cryptic  promot¬ 
ers  and  thus  the  promoterless  dicistronic  assay  is  a  better 
approach  to  confirm  the  existence  of  IRES  in  5'-UTR  se¬ 
quences.  It  is  noteworthy  that  two  unknown  bands  of  high 
mobility  were  also  detected  in  all  samples  on  Northern  blot¬ 
ting.  The  similar  products  have  also  been  observed  previously 
(11).  However,  because  they  are  smaller  than  the  monocis¬ 
tronic  mRNA,  they  are  not  expected  to  be  translated  into 
full-length  firefly  luciferase  proteins  and  do  not  affect  the  in¬ 
terpretation  of  the  data. 

DISCUSSION 

The  recent  rapid  increase  in  the  number  of  cellular  IRES 
elements  discovered  has  raised  great  interest  in  the  field  of 
gene  regulation  research  (27,  61,  71).  However,  because  the 
conventional  dicistronic  test  inherits  inevitable  drawbacks  and 
the  dicistronic  RNA  assays  do  not  work  for  cellular  IRES 
elements,  a  debate  still  exists  about  the  validity  of  cellular 
IRESs,  although  viral  IRESs  are  well  established  (36,  42,  63). 
In  this  study,  we  created  and  tested  a  promoterless  dicistronic 
vector  which  can  be  used  easily  to  safeguard  the  claims  of 
cellular  IRES  in  future  studies. 

One  of  the  more  than  40  cellular  mRNAs  that  have  been 
shown  to  have  IRES  activities  to  date  is  eIF4G  (see  http:// 
www.rangueil.inserm.fr/IRESdatabase).  As  a  major  subunit  of 
eukaryotic  translation  initiation  machinery,  eIF4G  plays  a  ma¬ 
jor  role  in  cell  growth  regulation  and  cell  transformation  (2, 18, 
26).  However,  regulation  of  eIF4G  expression  is  poorly  under¬ 
stood.  Two  different  isoforms  of  cDNA  for  eIF4G  have  been 
isolated  (35,  37,  74).  The  first  sequence  has  a  5'-UTR  of  368 
bases  which  contains  an  active  IRES  (20,  21).  The  IRES  ac¬ 
tivity  of  this  5'-UTR  has  been  well  characterized  and  mapped 
to  a  polypyrimidine  tract  upstream  of  the  AUG  start  codon 
(20).  The  second  cDNA  sequence  (extended  eIF4G)  encodes  a 
protein  that  has  156  more  amino  acids  at  the  amino  terminus 
than  the  first  one  and  contains  a  new  5'-UTR,  which  also 
contains  IRES  activity,  although  the  activity  is  much  lower  than 
that  of  the  first  5'-UTR  (35,  37).  Interestingly,  both  the  trun¬ 
cated  and  the  extended  versions  of  eIF4G  function  similarly  in 
promoting  protein  synthesis  and  cel!  transformation  (18,  26). 
Currently,  it  is  not  known  which  protein  isoforms  are  expressed 
in  cells.  The  5'-UTR  of  the  truncated  eIF4G  (74)  was  thought 
to  be  an  intron  of  the  extended  eIF4G  gene  (23,  37).  Analysis 
of  the  human  genome  sequence  shows  that  the  5'-UTR  is 


located  in  intron  4  of  the  extended  eIF4G  gene  (unpublished 
observation).  In  this  study,  we  showed  that  the  5'-UTR  of  the 
truncated  eIF4G  clearly  contains  promoter  sequence  which 
may  be  used  to  generate  transcripts  for  expression  of  truncated 
eIF4G.  It  is  therefore  tempting  to  speculate  that  at  least  two 
alternative  promoters  may  exist  before  exon  5  to  generate 
transcripts  with  various  length  of  the  5'-UTR  responsible  for 
production  of  the  truncated  eIF4G  protein.  Indeed,  sequence 
analysis  of  the  eIF4G  genome  showed  that  there  may  be  an¬ 
other  promoter  in  the  GC-rich  region  about  1.6  kb  upstream  of 
the  5'-UTR  of  truncated  eIF4G  (unpublished  observation). 

The  promoter  activity  associated  with  the  5'-UTR  of  eIF4G 
in  this  study  was  probably  thought  previously  to  be  an  IRES 
activity.  First,  expression  of  firefly  luciferase  of  the  second 
cistron  driven  by  the  5'-UTR  of  eIF4G  in  a  promoterless 
dicistronic  test  is  at  the  same  level  as  that  observed  in  a  con¬ 
ventional  dicistronic  test.  Second,  the  cryptic  promoter  se¬ 
quence  has  been  mapped  and  the  transcription  factor  binding 
sites  in  the  DNA  encoding  the  5'-UTR  have  been  delineated. 
Third,  some  of  the  transcription  factors  which  are  responsible 
for  promoter  activation  of  the  5'-UTR  of  eIF4G  have  also 
been  identified  using  EMSA  and  complementation  assays  in 
insect  cells.  Fourth,  the  Northern  blot  analysis  demonstrated 
that  monocistronic  transcripts  encoding  firefly  luciferase  were 
produced  from  both  the  conventional  and  the  promoterless 
dicistronic  plasmid.  Finally,  as  another  alternative  strategy  to 
bypass  the  complex  issue  of  transcription  or  splicing  possibly 
presented  by  5'-UTRs  in  the  conventional  dicistronic  DNA 
test,  we  also  performed  RNA  transfection  and  in  vitro  trans¬ 
lation  using  dicistronic  RNA  transcripts.  However,  the  5'-UTR 
of  eIF4G  failed  to  show  any  IRES  activity.  In  contrast,  the 
IRES  of  HRV  in  dicistronic  RNAs  can  direct  the  translation  of 
the  second  cistron  both  in  HeLa  cells  and  in  RRL  supple¬ 
mented  with  HeLa  extract.  Although  similar  observations  have 
been  made  with  some  other  cellular  IRESs  in  previous  studies 
(8,  70),  this  observation  significantly  undermines  the  claim  of 
cellular  IRES  activity  despite  the  possibility  that  nuclear  expe¬ 
rience  of  the  transcripts  other  than  transcription  or  splicing 
may  contribute  to  the  prerequisites  for  internal  initiation  me¬ 
diated  by  the  cellular  IRESs  (70). 

Interestingly,  we  showed  that  the  promoter  activity  in  the 
5'-UTR  sequence  of  eIF4G  relies  on  the  existence  of  the 
region  from  -68  to  -48,  which  corresponds  to  the  PPT  that 
was  previously  described  to  be  essential  for  IRES  activity  of 
eIF4G.  Either  deletion  or  point  mutation  of  this  region  re¬ 
sulted  in  dramatic  loss  of  the  promoter  activity.  This  region 
binds  to  transcription  factor  C/EBP(3,  a  member  of  the 
CCAAT/enhancer  binding  protein  (C/EBP)  family  that  is  ex¬ 
pressed  in  proliferating  cells  (46,  51).  Although  essential,  the 
C/EBPp  binding  element  itself  displays  minimal  promoter  ac¬ 
tivity.  We  also  showed  by  deletion  and  linker-scanning  analysis 
that  the  regions  from  -168  to  -68,  -151  to  -132,  and  -101 
to  -80  are  also  important  elements  for  this  promoter  activity. 
While  we  demonstrated  that  the  region  from  —151  to  -132 
binds  the  Spl/Sp3  transcription  factor,  more  work  is  needed  to 
identify  the  proteins  that  bind  to  the  region  from  -101  to  -80, 
which  was  shown  to  be  highly  homologous  to  the  Ets-1  con¬ 
sensus  binding  sequence.  This  region  binds  to  at  least  six  pro¬ 
teins  in  HeLa  nuclear  extracts,  which  remain  to  be  identified. 
Since  different  Ets  proteins  exhibit  low  selectivity  in  binding- 
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site  preference  (66,  72)  and  since  Ets-1  proteins  do  not  exist  in 
Drosophila  SL2  cells,  we  analyzed  the  possible  effect  of  Ets-1 
on  the  promoter  activity  in  the  5'-UTR  sequence  of  eIF4G  in 
SL2  cells.  Our  results  clearly  demonstrated  that  Ets-1  could 
trans-activate  the  promoter  activity  in  the  5'-UTR  sequence  of 
eIF4G.  We  also  demonstrated  that  Spl  and  Ets-1  work  syner- 
gistically  on  this  promoter  and  stimulate  the  promoter  activity 
by  about  10  to  20-fold.  The  apparent  disparity  between  this 
number  and  the  nearly  1,000-fold  expression  noted  in  Fig.  2 
suggests  that  other  transcription  factors,  C/EBP(3  in  particular, 
may  also  play  a  major  role  in  the  activation  of  the  5'-UTR 
promoter.  Based  on  the  results  of  our  promoter  analysis,  we 
conclude  that  the  transcription  factors  C/EBPp,  Spl/Sp3,  and 
Ets-1  functionally  interact  to  trans-activate  the  cryptic  pro¬ 
moter  in  the  5'-UTR  sequence  of  eIF4G  and  therefore  may  be 
responsible  for  the  transcriptional  regulation  of  the  truncated 
version  of  eIF4G. 

Interestingly,  pR-eIF4G-F  generated  an  additional  minor 
monocistronic  transcript  (Fig.  9A)  that  is  slightly  larger  than 
the  normal  monocistronic  transcript  (Fig.  9A),  generated  by 
the  cryptic  promoter  in  pR-eIF4G-F(— P).  This  transcript  is 
unlikely  to  be  derived  from  transcription  by  the  promoter  in 
the  5'-UTR  sequence  of  eIF4G  because  it  is  not  produced  by 
pR-eIF4G-F(-P).  It  is  probably  generated  by  other  mecha¬ 
nisms  related  to  the  5'-UTR  sequence  of  eIF4G  because  such 
a  transcript  was  not  observed  with  the  vector  control  and  other 
constructs  containing  the  5'-UTR  of  HRV,  Bad,  and  Sno.  One 
such  possible  mechanism  is  alternative  splicing.  The  PPT  re¬ 
gion  in  the  5'-UTR  of  eIF4G  is  followed  by  a  perfect  splicing 
acceptor  sequence  (  ITTCITl'CCCCAGA)  which  has  been 
suggested  to  be  used  for  splicing  to  generate  the  extended  form 
of  eIF4G  (23).  Use  of  this  acceptor  sequence  and  the  upstream 
donor  sequence  in  the  chimeric  intron  (see  Fig.  1)  would  result 
in  the  production  of  a  transcript  without  the  Renilla  luciferase 
gene  sequence.  Furthermore,  although  the  dicistronic  tran¬ 
script  was  not  detected  by  Northern  blot  analysis,  pR-eIF4G-F 
produced  about  10%  Renilla  luciferase  activity  compared  to 
other  constructs  that  produced  the  dicistronic  transcripts  (Fig. 
9),  suggesting  that  about  10%  of  the  dicistronic  transcripts 
exist  in  the  cells  transfected  with  pR-eIF4G-F.  The  other  90% 
of  the  dicistronic  transcripts  may  have  been  processed  to 
smaller  transcripts  by  alternative  splicing.  Based  on  the  inten¬ 
sity  of  the  bands  on  Northern  blots  (Fig.  9A),  the  minor  mono¬ 
cistronic  transcript  produced  by  alternative  splicing  (lane  3)  is 
about  one-third  as  common  as  that  produced  by  transcription 
(lane  3)  using  the  promoter  in  the  5'-UTR  sequence  of  eIF4G. 
Therefore,  alternative  splicing  may  represent  another  mecha¬ 
nism  responsible  for  the  apparently  high  firefly  luciferase  ac¬ 
tivity  observed  with  the  5'-UTR  sequence  of  eIF4G  in  the 
conventional  dicistronic  test.  In  light  of  these  findings,  it  is 
imperative  to  rule  out  the  alternative  splicing  as  well  as  the 
cryptic  promoter  when  examining  a  candidate  IRES  of  cellular 
mRNAs,  possibly  by  testing  with  an  intronless  dicistronic  con¬ 
struct. 

Heterogeneity  of  transcription  initiation  has  been  docu¬ 
mented  previously  for  some  proto-oncogenes  such  as  c -myc 
(47,  48).  Transcription  of  c -myc  involves  P0,  PI,  P2,  and  P3 
promoters.  Differential  promoter  usage  has  been  observed  in  a 
variety  of  cell  types,  and  atypical  usage  of  the  promoter  has 
generally  been  associated  with  abnormal  or  deregulated  con¬ 


trol  of  cell  growth.  The  transcription  starting  point  of  P0,  PI, 
and  P2  c -myc  mRNAs  are  located  1,172,  524,  and  363  bases 
upstream  from  the  translation  start  codon  CUG,  respectively, 
while  that  of  P3  resides  in  an  intron.  PI  and  P2  promoter 
sequences  are  therefore  located  in  the  5'-UTR  of  P0  tran¬ 
scripts.  Similarly,  the  1,038  bases  of  the  5'-UTR  sequence  of 
vascular  endothelial  growth  factor  were  also  demonstrated  to 
contain  an  alternative  transcription  initiation  site  (1). 

While  it  is  recognized  that  the  long  5'-UTRs  present  at  the 
majority  of  the  proto-oncogenes  inhibit  cap-dependent  trans¬ 
lation  (73)  and  that  IRES-mediated  translation  initiation  may 
be  used  for  these  mRNAs,  it  has  not  been  well  appreciated  that 
these  long  5'-UTR  DNA  sequences  may  contain  promoters  for 
transcription  of  a  less  abundant  mRNA  species  with  signifi¬ 
cantly  shorter  5'-UTRs.  These  alternative  transcripts  with 
shorter  5'-UTRs  are  compatible  with  cap-dependent  transla¬ 
tion  initiation  and  thus  do  not  require  an  IRES-mediated 
translation  mechanism.  For  example,  the  major  transcript  of 
platelet-derived  growth  factor  B  chain  (PDGF  B/c-sm)  has  a 
5'-UTR  of  1,023  bases  that  is  transcribed  by  an  upstream 
TATA-containing  promoter  (17,  40,  58).  However,  this  tran¬ 
script  cannot  be  translated  due  to  the  large  size  of  the  5'-UTR 
(34,  58,  59).  Recently,  it  has  been  suggested  that  IRES-medi¬ 
ated  translation  initiation  may  play  an  important  role  in  the 
translational  regulation  of  PDGF  B/c-sw  mRNA  during  cell 
differentiation  (4),  and  the  630-base  sequence  within  the  cen¬ 
tral  portion  of  the  5'-UTR  is  important  for  the  IRES  activity 
(65).  Interestingly,  earlier  studies  of  the  PDGF  B/c sis  gene 
showed  that  an  additional  transcript  with  a  5'-UTR  of  only  15 
bases  was  produced  in  cultured  cells  on  phorbol  myristate 
acetate  or  transforming  growth  factor  (31  stimulation  (17)  and 
in  developing  rat  brain  (62).  The  production  of  this  shorter 
transcript  was  postulated  to  derive  from  internal  transcription 
initiation,  as  suggested  by  DNase  I  hypersensitivity  analysis  of 
the  first  exon  sequences  (17).  Sequence  analysis  indeed  re¬ 
vealed  multiple  Spl  binding  sites  in  the  5'-UTR  sequence  of 
PDGF  B/c  -sis  (reference  63  and  unpublished  observations).  It 
is  therefore  possible  that  the  alternative  transcription  for  a 
shorter  and  less  abundant  5'-UTR  may  be  important  in  regu¬ 
lating  PDGF  Blc-sis  expression  during  cell  differentiation, 
which  would  undermine  the  reported  IRES  activity  of  the  long 
5'-UTR  of  PDGF  B/c  -sis.  This  possibility  certainly  merits  at¬ 
tention  in  future  studies  of  the  translational  regulation  of  any 
long  cellular  5'-UTRs. 

The  promoterless  dicistronic  vector  created  in  this  study  is 
clearly  a  very  sensitive  approach  to  detect  any  potential  cryptic 
promoters  in  long  cellular  5'-UTRs.  Promoterless  vector  has 
been  used  in  one  earlier  study  for  investigation  of  possible 
IRES  activity  in  the  5'-UTR  from  XIAP  (X-linked  inhibitor  of 
apoptosis  protein)  (32).  The  advantage  of  our  vector  is  that  it 
was  derived  from  parental  dicistronic  vectors,  and  therefore 
data  from  two  sets  of  vectors  are  readily  comparable.  As  we 
have  also  shown  with  the  5'-UTRs  of  Bad,  Sno,  and  HIAP,  this 
vector  should  be  applicable  to  any  cellular  5'-UTRs  and  can  be 
used  as  a  good  control  for  analysis  of  cellular  IRES  elements. 
Despite  the  differences  in  nucleotide  composition,  the  5'- 
UTRs  of  Bad,  Sno,  and  HIAP  are  all  long  and  punctuated  with 
multiple  AUGs  that  are  incompatible  with  the  cap-dependent 
ribosome-scanning  mechanism  of  translation  initiation;  there¬ 
fore,  they  are  potential  candidates  as  IRES  elements.  How- 
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ever,  using  the  promoterless  dicistronic  vector,  we  found  that 
the  5'-UTRs  of  Bad  and  Sno  contain  promoter  activity  at  a 
moderate  level  with  no  IRES  activity  whereas  the  5'-UTR  of 
HIAP  does  not  contain  either  promoter  or  IRES  activities. 
This  observation,  together  with  the  findings  of  eIF4G  in  this 
study,  argues  that  cryptic  promoter  activities  prevail  in  the 
5'-UTR  of  cellular  mRNAs,  which  may  generate  alternative 
transcripts  with  shorter  5'-UTRs  compatible  with  cap-depen¬ 
dent  translation  initiation.  Furthermore,  our  failure  to  detect 
the  less  abundant  monocistronic  transcripts  generated  from 
the  promoter  in  the  5'-UTR  sequence  of  Bad  and  Sno  by 
Northern  blot  analysis  undermines  the  usefulness  of  this 
method  in  IRES  studies.  If  not  carefully  ruled  out  by  the 
promoterless  dicistronic  test,  these  5'-UTRs  would  otherwise 
be  considered  to  have  IRES  activities.  Therefore,  we  propose 
that  the  promoterless  dicistronic  vector  should  be  used  as  a 
control  to  safeguard  the  claim  of  cellular  IRES  in  future  stud¬ 
ies. 
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